Staphylococcus ratti sp. nov. Isolated from a Lab Rat

Staphylococci from the Staphylococcus intermedius-Staphylococcus hyicus species group include numerous animal pathogens and are an important reservoir of virulence and antimicrobial resistance determinants. Due to their pathogenic potential, they are possible causative agents of zoonoses in humans; therefore, it is important to address the properties of these strains. Here we used a polyphasic taxonomic approach to characterize the coagulase-negative staphylococcal strain NRL/St 03/464T, isolated from the nostrils of a healthy laboratory rat during a microbiological screening of laboratory animals. The 16S rRNA sequence, MALDI-TOF mass spectrometry and positive urea hydrolysis and beta-glucuronidase tests clearly distinguished it from closely related Staphylococcus spp. All analyses have consistently shown that the closest relative is Staphylococcus chromogenes; however, values of digital DNA-DNA hybridization <35.3% and an average nucleotide identity <81.4% confirmed that the analyzed strain is a distinct Staphylococcus species. Whole-genome sequencing and expert annotation of the genome revealed the presence of novel variable genetic elements, including two plasmids named pSR9025A and pSR9025B, prophages, genomic islands and a composite transposon that may confer selective advantages to other bacteria and enhance their survival. Based on phenotypic, phylogenetic and genomic data obtained in this study, the strain NRL/St 03/464T (= CCM 9025T = LMG 31873T = DSM 111348T) represents a novel species with the suggested name Staphylococcus ratti sp. nov.


Introduction
Staphylococci are opportunistic pathogens widespread in nature; they are mainly isolated from the skin, skin glands, and mucous membranes of various animals [1], and less often from the blood of diseased animals [2]. The nasal cavity of many mammalian species is inhabited by distinctive staphylococcal species. Studies of Staphylococcus spp. distribution in wild rodents have shown the presence of predominantly coagulase-negative staphylococci, including Staphylococcus xylosus, Staphylococcus equorum, Staphylococcus succinus, Staphylococcus saprophyticus and Mammaliicoccus spp. [3,4]. The species composition of staphylococcal populations in wild and laboratory rats is largely unknown. The predominant species in laboratory rats is S. xylosus, followed by Staphylococcus aureus and

Phylogenetic Analyses
The isolate CCM 9025 T (= NRL/St 03/464 T ) was assigned by a partial sequencing of the 16S rRNA gene to the Staphylococcus genus and previously defined Hyicus-Intermedius species group [31]; however, its biochemical profile, MALDI-TOF MS pattern and partial RNA polymerase subunit beta (rpoB) gene sequence did not allow for its classification into any known staphylococcal species. Therefore, a polyphasic taxonomic study was conducted which was focused on a detailed characterization of a new Staphylococcus species.
The obtained complete 16S rRNA gene sequence of strain CCM 9025 T was compared to those of other taxa from the Hyicus-Intermedius species group of the Staphylococcus genus. The closest relatives were S. chromogenes (99.3% similarity), S. agnetis (99.2%) and S. hyicus (99.1%); other species were below 98% similarity. The topology of the maximum likelihood phylogenetic tree constructed with 16S rRNA gene sequences was similar to that of the neighbor-joining tree (Figure 1a). (c) (d) Figure 1. Evolutionary analyses of S. hyicus-S. intermedius phylogenetic complex including S. ratti sp. nov. (a) Unrooted phylogenetic tree based on complete 16S rRNA gene sequences extracted from whole-genomic sequencing data (GenBank accession numbers are in parentheses). The evolutionary history was inferred by using the maximum likelihood method and Tamura-Nei model. Filled circles indicate that the corresponding nodes were also obtained in the tree constructed by the neighbor-joining method. The percentage of 500 tree replications above 50% in which the associated taxa clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. There were a total of 1551 positions in the final dataset. (b) Unrooted maximum likelihood tree based on multilocus sequence analysis of concatenated nucleotide sequences from six loci, rpoB, hsp60, dnaJ, tufA, gap and sodA, extracted from whole genome assemblies (accession numbers are in parentheses). There were a total of 3972 positions in the final dataset. Bootstrap probability values (percentages of 500 tree replications) greater than 50% are shown at branch points. The evolutionary distances are given as the number of substitutions per site. (c) Nucleotide core gene set phylogenetic tree of S. ratti sp. nov. and phylogenetically related species. (d) Protein sequence-based phylogenetic tree of the core gene set of S. ratti sp. nov. and phylogenetically related species. Trees (c) and (d) were constructed using up-to-date bacterial core gene set (UBCG; concatenated alignment of 92 core genes). The maximum likelihood tree was inferred using RAxML software and set to 100 replicates. Gene support indices are given at branching points (maximal possible value is 92). Bar, 0.1 substitution per position.
Because the 16S rRNA analysis has limited discriminatory power for the identification of some staphylococcal species, the phylogenetic position of the new isolates was also assessed using the concatenated multilocus sequence data of six routinely used house- nov. (a) Unrooted phylogenetic tree based on complete 16S rRNA gene sequences extracted from whole-genomic sequencing data (GenBank accession numbers are in parentheses). The evolutionary history was inferred by using the maximum likelihood method and Tamura-Nei model. Filled circles indicate that the corresponding nodes were also obtained in the tree constructed by the neighbor-joining method. The percentage of 500 tree replications above 50% in which the associated taxa clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. There were a total of 1551 positions in the final dataset. (b) Unrooted maximum likelihood tree based on multilocus sequence analysis of concatenated nucleotide sequences from six loci, rpoB, hsp60, dnaJ, tufA, gap and sodA, extracted from whole genome assemblies (accession numbers are in parentheses). There were a total of 3972 positions in the final dataset. Bootstrap probability values (percentages of 500 tree replications) greater than 50% are shown at branch points. The evolutionary distances are given as the number of substitutions per site. (c) Nucleotide core gene set phylogenetic tree of S. ratti sp. nov. and phylogenetically related species. (d) Protein sequence-based phylogenetic tree of the core gene set of S. ratti sp. nov. and phylogenetically related species. Trees (c) and (d) were constructed using up-to-date bacterial core gene set (UBCG; concatenated alignment of 92 core genes). The maximum likelihood tree was inferred using RAxML software and set to 100 replicates. Gene support indices are given at branching points (maximal possible value is 92). Bar, 0.1 substitution per position.
Because the 16S rRNA analysis has limited discriminatory power for the identification of some staphylococcal species, the phylogenetic position of the new isolates was also assessed using the concatenated multilocus sequence data of six routinely used housekeeping genes ( Figure 1b). The maximum likelihood phylogenetic trees for the housekeeping genes including the closest relatives had a very similar topology which corresponded to that of the 16S rRNA gene tree, clearly separated the novel isolate CCM 9025 T from the established species and which confirmed S. chromogenes as the closest relative. The significant phylogenetic distance from the related staphylococcal species at the whole-genome level, with an average nucleotide identity (ANI) of <81.4% and digital DNA-DNA hybridization (dDDH) of <35.3%, were below the species delineation thresholds, which are 95-96% and 70%, respectively [32]. This confirmed that strain CCM 9025 T represents a distinct Staphylococcus species named Staphylococcus ratti sp. nov. It forms a separate branch in the S. hyicus phylogenetic clade, as also shown by protein coding core genome analysis using the up-to-date bacterial core gene (UBCG) (Figure 1c,d).

Phenotypic, Genotypic and Chemotaxonomic Characteristics
Cells of strain CCM 9025 T are irregular spherical cocci with diameter 705 ± 55 nm (n = 100) ( Figure 2). The strain grew well on common media for staphylococci and was mesophilic and moderately halophilic, with the ability to hydrolyze biomacromolecules (gelatin, DNA). Carbohydrates seldom served as the source of carbon. The detailed phenotype data are subsequently mentioned in the species description in the text; here we only specify several notable results. Interestingly, the hyaluronidase test result was positive for CCM 9025 T , which is a rare feature for coagulase-negative staphylococcal species. During biotyping, a few contradictory results depending on the tested conditions were found. Firstly, the Voges-Proskauer test (acetoin) was negative in a standard tube test, but positive with a commercial VPtest strip containing pyruvic acid instead of glucose. The second inconsistency was related to an enzymatic β-glucuronidase test included in the commercial kits, when β-glucuronidase was positive for CCM 9025 T in STAPHYtest 24, but negative in API ZYM, probably due to different substrates being used for enzyme detection. keeping genes (Figure 1b). The maximum likelihood phylogenetic trees for the housekeeping genes including the closest relatives had a very similar topology which corresponded to that of the 16S rRNA gene tree, clearly separated the novel isolate CCM 9025 T from the established species and which confirmed S. chromogenes as the closest relative. The significant phylogenetic distance from the related staphylococcal species at the whole-genome level, with an average nucleotide identity (ANI) of <81.4% and digital DNA-DNA hybridization (dDDH) of <35.3%, were below the species delineation thresholds, which are 95-96% and 70%, respectively [32]. This confirmed that strain CCM 9025 T represents a distinct Staphylococcus species named Staphylococcus ratti sp. nov. It forms a separate branch in the S. hyicus phylogenetic clade, as also shown by protein coding core genome analysis using the up-to-date bacterial core gene (UBCG) (Figure 1c,d).

Phenotypic, Genotypic and Chemotaxonomic Characteristics
Cells of strain CCM 9025 T are irregular spherical cocci with diameter 705 ± 55 nm (n = 100) ( Figure 2). The strain grew well on common media for staphylococci and was mesophilic and moderately halophilic, with the ability to hydrolyze biomacromolecules (gelatin, DNA). Carbohydrates seldom served as the source of carbon. The detailed phenotype data are subsequently mentioned in the species description in the text; here we only specify several notable results. Interestingly, the hyaluronidase test result was positive for CCM 9025 T , which is a rare feature for coagulase-negative staphylococcal species. During biotyping, a few contradictory results depending on the tested conditions were found. Firstly, the Voges-Proskauer test (acetoin) was negative in a standard tube test, but positive with a commercial VPtest strip containing pyruvic acid instead of glucose. The second inconsistency was related to an enzymatic β-glucuronidase test included in the commercial kits, when β-glucuronidase was positive for CCM 9025 T in STAPHYtest 24, but negative in API ZYM, probably due to different substrates being used for enzyme detection.
The tests distinguishing novel species from closely related staphylococci of the S. hyicus group are shown in Table 1, and these tests enable the correct identification of CCM 9025 T at the species level. Profiling analysis by matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry (MALDI-TOF MS) is routinely used for bacterial identification. A distinct MALDI-TOF MS pattern is therefore a useful phenotypic feature in describing new species. Strain CCM 9025 T generated a consistent MALDI-TOF MS profile containing signals in the mass range of 2-10 kDa, which was not related to any of those Staphylococcus species already represented in the commercial database at the time of testing. After the manual inclusion of profiles from 24 analyses of the strain CCM 9025 T to the in-house The tests distinguishing novel species from closely related staphylococci of the S. hyicus group are shown in Table 1, and these tests enable the correct identification of CCM 9025 T at the species level.
Profiling analysis by matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry (MALDI-TOF MS) is routinely used for bacterial identification. A distinct MALDI-TOF MS pattern is therefore a useful phenotypic feature in describing new species. Strain CCM 9025 T generated a consistent MALDI-TOF MS profile containing signals in the mass range of 2-10 kDa, which was not related to any of those Staphylococcus species already represented in the commercial database at the time of testing. After the manual inclusion of profiles from 24 analyses of the strain CCM 9025 T to the in-house database, the strain was re-analyzed and matched the new pattern with a score of 2.6, whereas the next closest species was S. schleiferi with a score of 1.2, which is far below the species identification threshold.  Staphylococci are easily differentiated from other Gram-positive cocci at the genus level by cellular fatty acid analysis, which is also one of the recommended tests at the species level. The cellular fatty acid profile of strain CCM 9025 T revealed 4 major fatty acids (FAs) (≥10%), iso-C 15:0 (35.0%), anteiso-C 15:0 (24.1%), iso-C 17:0 (10.8%) and anteiso-C 17:0 (10.2%), similar to other validly named Staphylococcus spp. [33]. Comparison to the closest related Staphylococcus spp. showed qualitatively similar profiles of FAs with quantitative differences between the compared type strains ( Table 2). Qualitative differences in iso-C 15:0 , iso-C 17:0 and C 20:0 clearly distinguish strain CCM 9025 T from S. chromogenes CCM 3387 T , and lower amounts of anteiso-C 17:0 are specific for S. agnetis CCM 8869 T . S. hyicus CCM 2368 T has the most similar FA profile to CCM 9025 T , but can be distinguished by lower amounts of branched C 17:0 FAs and a higher amount of iso-C 15:0 . The major respiratory quinone in strain CCM 9025 T was MK-7 (95.7%). Menaquinones MK-6 and MK-8 were also detected as minor components of the electron transport system. The identification of MK-7 as the major component of the quinone system is in accordance with the genus characteristics, as members of the genus Staphylococcus reveal the presence of unsaturated menaquinones, typically with six, seven or eight isoprene units [33,34]. Analysis of the peptidoglycan structure revealed a cross-linkage type A structure of A3α L-Lys-Gly 3-4 similar to the type A11.2 structure [35], with the molar amino acid ratio 2.0 Ala:3.4 Gly:1.0 Glu:0.8 Lys. The type A11.2 peptidoglycan structure was also identified in the closest related species S. chromogenes, S. agnetis and S. hyicus, as described by Schumann [35]. Unlike CCM 9025 T and S. agnetis, S. chromogenes and S. hyicus were found to also contain minor amounts of serine, likely substituting for some glycine in the interpeptide bridge [24,33,36].
A DNA fingerprinting technique using repetitive sequence-based PCR (rep-PCR) fingerprinting with the (GTG) 5 primer, previously shown to be suitable for the simultaneous detection and differentiation of Staphylococcus spp., was used to demonstrate the difference of strain CCM 9025 T from related taxa. The rep-PCR clearly distinguished the analyzed strain from the type strains representing the phylogenetically close Staphylococcus spp. (Figure 3). transport system. The identification of MK-7 as the major component of the quinone system is in accordance with the genus characteristics, as members of the genus Staphylococcus reveal the presence of unsaturated menaquinones, typically with six, seven or eight isoprene units [33,34]. Analysis of the peptidoglycan structure revealed a cross-linkage type A structure of A3α L-Lys-Gly3-4 similar to the type A11.2 structure [35], with the molar amino acid ratio 2.0 Ala : 3.4 Gly : 1.0 Glu : 0.8 Lys. The type A11.2 peptidoglycan structure was also identified in the closest related species S. chromogenes, S. agnetis and S. hyicus, as described by Schumann [35]. Unlike CCM 9025 T and S. agnetis, S. chromogenes and S. hyicus were found to also contain minor amounts of serine, likely substituting for some glycine in the interpeptide bridge [24,33,36].
A DNA fingerprinting technique using repetitive sequence-based PCR (rep-PCR) fingerprinting with the (GTG)5 primer, previously shown to be suitable for the simultaneous detection and differentiation of Staphylococcus spp., was used to demonstrate the difference of strain CCM 9025 T from related taxa. The rep-PCR clearly distinguished the analyzed strain from the type strains representing the phylogenetically close Staphylococcus spp. (Figure 3).

Whole Genome Characterization of Staphylococcus ratti sp. nov.
The genome of S. ratti sp. nov. type strain CCM 9025 T was sequenced using Illumina and Oxford Nanopore platforms. The size of the complete chromosome assembly is 2.3 Mb with a mean coverage of 500-fold. Based on the NCBI automated annotation pipeline, a total of 2198 CDSs were identified in the genome, of which 2150 were protein-encoding genes. A total of 82 genes for RNAs were identified in the genome, including 59 tRNAs, 19 rRNAs including 7 (5S), 6 (16S) and 6 (23S), and 4 ncRNAs. Two plasmid sequences named pSR9025A (3311 bp) and pSR9025B (2455 bp) were assembled and annotated as separate extrachromosomal replicons. Plasmid pSR9025A encodes the gene for the Rep protein, which shares 100% amino acid identity with the Rep protein gene in p908 of S. agnetis [37] and a gsiB (glucose starvation-inducible protein B) gene homologue which is involved in a stress response. Further short similar regions were identified in the plasmids of many other coagulase-negative staphylococci. The plasmid pSR9025B is a cryptic plasmid similar (75.5% identity and 27% coverage) to the pLNU9 plasmid of S.

Whole Genome Characterization of Staphylococcus ratti sp. nov.
The genome of S. ratti sp. nov. type strain CCM 9025 T was sequenced using Illumina and Oxford Nanopore platforms. The size of the complete chromosome assembly is 2.3 Mb with a mean coverage of 500-fold. Based on the NCBI automated annotation pipeline, a total of 2198 CDSs were identified in the genome, of which 2150 were protein-encoding genes. A total of 82 genes for RNAs were identified in the genome, including 59 tRNAs, 19 rRNAs including 7 (5S), 6 (16S) and 6 (23S), and 4 ncRNAs. Two plasmid sequences named pSR9025A (3311 bp) and pSR9025B (2455 bp) were assembled and annotated as separate extrachromosomal replicons. Plasmid pSR9025A encodes the gene for the Rep protein, which shares 100% amino acid identity with the Rep protein gene in p908 of S. agnetis [37] and a gsiB (glucose starvation-inducible protein B) gene homologue which is involved in a stress response. Further short similar regions were identified in the plasmids of many other coagulase-negative staphylococci. The plasmid pSR9025B is a cryptic plasmid similar (75.5% identity and 27% coverage) to the pLNU9 plasmid of S. chromogenes [38]. This is an indication of interspecies transfer of these types of variable genetic elements (VGEs).
Comparative genomic analysis of S. ratti with the two type strains S. chromogenes NCTC 10530 T and S. hyicus ATCC 11249 T revealed the presence of additional VGEs, including insertion sequence elements, a composite transposon, one prophage, and a genomic island (Figure 4). Comparative genomic analysis of S. ratti with the two type strains S. chromogenes NCTC 10530 T and S. hyicus ATCC 11249 T revealed the presence of additional VGEs, including insertion sequence elements, a composite transposon, one prophage, and a genomic island (Figure 4). Type I restriction-modification (RM) system genes were found downstream of orfX, but no evidence of staphylococcal cassette chromosome (SCC) integration was found. Type II RM system genes are localized near a cap operon. The clustered, regularly interspaced, short, palindromic repeats (CRISPR)/CRISPR-associated gene (Cas) system was identified in the same location in both the S. ratti and S. hyicus genomes. The genes for predicted virulence factors, surface and extracellular proteins found in S. ratti CCM 9025 T genome are shown in Table 3.
The accessory genome is often associated with virulence and antimicrobial resistance and has an important role in the ability of species to colonize particular hosts or persist in the environment. To determine the genomic diversity within all Hyicus-Intermedius species group representatives, the pangenome was analyzed. Type strains within the group have shown an extensive accessory genome whose profile correlates well with the phylogenetic relationship of individual species ( Figure 5). The comparison indicates a distinct accessory genes group which is shared with S. hyicus, S. chromogenes and S. agnetis species ( Figure 5).  Type I restriction-modification (RM) system genes were found downstream of orfX, but no evidence of staphylococcal cassette chromosome (SCC) integration was found. Type II RM system genes are localized near a cap operon. The clustered, regularly interspaced, short, palindromic repeats (CRISPR)/CRISPR-associated gene (Cas) system was identified in the same location in both the S. ratti and S. hyicus genomes. The genes for predicted virulence factors, surface and extracellular proteins found in S. ratti CCM 9025 T genome are shown in Table 3. The accessory genome is often associated with virulence and antimicrobial resistance and has an important role in the ability of species to colonize particular hosts or persist in the environment. To determine the genomic diversity within all Hyicus-Intermedius species group representatives, the pangenome was analyzed. Type strains within the group have shown an extensive accessory genome whose profile correlates well with the phylogenetic relationship of individual species ( Figure 5). The comparison indicates a distinct accessory genes group which is shared with S. hyicus, S. chromogenes and S. agnetis species ( Figure 5).  The genome harbors various types of insertion sequences (ISs) from the IS200/IS605, IS3 and IS6 families. In addition to the above ISs, a 9.1-kb composite transposon with 28.3% G+C content is integrated into the genome and flanked by two elements from the IS3 family (Figure 4). The transposon encodes a gene for the radical S-adenosyl-Lmethionine (SAM) enzyme (LN051_08115), which is involved in a number of metabolic processes, including post-transcriptional and post-translational modifications, and a gene encoding a YcaO-like protein (LN051_08100) which is responsible for the synthesis of thiazole/oxazole-modified microcin antibiotics [39]. The SagB/ThcOx family dehydrogenase gene (LN051_08090) for a membrane-associated N-acetylglucosaminidase that cleaves polymerized glycan strands to their physiological length and a major facilitator superfamily (MFS) transporter gene (LN051_08095), are also part of the transposon. Since these genes are linked to modulating antibiotic resistance in methicillinresistant S. aureus [40], we hypothesize that these genes may be responsible for the penicillin resistance of this strain.
One prophage designated vB_SraS_LR1 with a typical siphoviral modular structure is integrated at the 13-bp-long putative att site 5′ AAAATCAACYTTT 3′ adjacent to the tRNA Arg gene (locus tag LN051_09185) and exhibits 75.9% identity and 43% coverage with the S. hyicus phage EW (= RG = NCTC 9856) [41], which was previously misclassified according to its genomic sequence [42] as an S. aureus bacteriophage belonging to the Phietavirus genus [43]. Moderate similarity was found (77.3% identity and 30% coverage) to the S. hyicus phage PMBT9 from the Siphoviridae family [44].
A 15-kb long phage-inducible chromosomal island designated SrRICCM9025 was identified in the genome of CCM 9025 T (Figure 4). It has 30.1% G+C content and harbours a site-specific integrase (LN051_02785), IS6 family transposase (LN051_02790), the gene virE encoding virulence-associated E family protein (LN051_02795), and new putative phosphotransferase genes which may be related to antimicrobial resistance. SrRICCM9025 exhibits partial sequence similarity to the Macrococcus island McRImsr [45] in the proteins The genome harbors various types of insertion sequences (ISs) from the IS200/IS605, IS3 and IS6 families. In addition to the above ISs, a 9.1-kb composite transposon with 28.3% G+C content is integrated into the genome and flanked by two elements from the IS3 family ( Figure 4). The transposon encodes a gene for the radical S-adenosyl-L-methionine (SAM) enzyme (LN051_08115), which is involved in a number of metabolic processes, including post-transcriptional and post-translational modifications, and a gene encoding a YcaO-like protein (LN051_08100) which is responsible for the synthesis of thiazole/oxazole-modified microcin antibiotics [39]. The SagB/ThcOx family dehydrogenase gene (LN051_08090) for a membrane-associated N-acetylglucosaminidase that cleaves polymerized glycan strands to their physiological length and a major facilitator superfamily (MFS) transporter gene (LN051_08095), are also part of the transposon. Since these genes are linked to modulating antibiotic resistance in methicillin-resistant S. aureus [40], we hypothesize that these genes may be responsible for the penicillin resistance of this strain.
One prophage designated vB_SraS_LR1 with a typical siphoviral modular structure is integrated at the 13-bp-long putative att site 5 AAAATCAACYTTT 3 adjacent to the tRNA Arg gene (locus tag LN051_09185) and exhibits 75.9% identity and 43% coverage with the S. hyicus phage EW (= RG = NCTC 9856) [41], which was previously misclassified according to its genomic sequence [42] as an S. aureus bacteriophage belonging to the Phietavirus genus [43]. Moderate similarity was found (77.3% identity and 30% coverage) to the S. hyicus phage PMBT9 from the Siphoviridae family [44].
A 15-kb long phage-inducible chromosomal island designated SrRI CCM9025 was identified in the genome of CCM 9025 T (Figure 4). It has 30.1% G+C content and harbours a site-specific integrase (LN051_02785), IS6 family transposase (LN051_02790), the gene virE encoding virulence-associated E family protein (LN051_02795), and new putative phosphotransferase genes which may be related to antimicrobial resistance. SrRI CCM9025 exhibits partial sequence similarity to the Macrococcus island McRI msr [45] in the proteins for integration and replication but otherwise demonstrates a distinct gene structure ( Figure 6). It is integrated adjacent to the lacA, lacB, lacD, lacG genes and genes for lactose/cellobiosespecific phosphotransferase system genes, which are required for galactose 6-phosphate isomerase activity, described as part of the S. aureus lactose operon lacABCDFEG [46]. Pathogens 2022, 11, x FOR PEER REVIEW 9 of 17 specific phosphotransferase system genes, which are required for galactose 6-phosphate isomerase activity, described as part of the S. aureus lactose operon lacABCDFEG [46]. Despite the fact that mobile elements were found in the genome, the identification of the CRISPR/Cas system in the S. ratti CCM 9025 T genome is consistent with the need to limit the uptake of foreign DNA. Strain CCM 9025 T carries a 431-bp-long CRISPR loci with 6 spacers flanked by 36-bp-long direct repeats (DR: GTTTTAGTACTCTGTAATTTTAG-TATAAGGTATTC) and putative cas genes encoding CRISPR-associated endonuclease Cas9 and the proteins Cas1, Cas2 and Csn1 typical for CRISPR system type II-A. Cas9 exhibited 79% amino acid identity to the type II CRISPR-associated Cas9 of S. agnetis (WP_107390356) [47] and S. chromogenes (WP_145399953), and 74% amino acid identity to Cas9 of S. hyicus ATCC 11249 T (WP_167696241) [48]. Spacers target bacteriophage-related sequences, but no significant similarity to staphylococcal phage genomes was found, indicating a gap in the knowledge of phages infecting this host.

Bacterial Strains and Cultivation
The isolate NRL St 03/464 T was isolated from a single animal and maintained as glycerol stock at −70 • C until analyzed. Reference strains of the phylogenetic relatives S. agnetis CCM 8869 T , S. hyicus CCM 2368 T , and S. chromogenes CCM 3387 T were obtained from the Czech Collection of Microorganisms (Brno, Czech Republic). All cultivations were performed at 30 • C for 24 h unless stated otherwise in the test specifications. Type strain Staphylococcus ratti NRL/St 03/464 T has been deposited as publicly accessible in the Czech Collection of Microorganisms (CCM), the German Collection of Microorganisms and Cell Cultures (DSMZ), and the BCCM/LMG Bacteria collection.

Transmission Electron Microscopy
A 200-mesh carbon/formvar-coated grid was placed on a drop of suspension of bacteria in water for 20 min. Bacterial cells located on the grid were negatively stained with 2% ammonium molybdate and treated with UV light. A Morgagni 268D Philips (ThermoFisher Scientific, Amsterdam, The Netherlands) transmission electron microscope was used to visualize bacterial cells.

Matrix-Assisted Laser Desorption/Ionization Time-of-Flight Mass Spectrometry (MALDI-TOF MS) Analysis
MALDI-TOF MS was performed in automatic acquisition mode as described previously [53] in a Microflex LT MALDI-TOF spectrometer (Bruker Daltonics, Bremen, Germany) by using MBT Compass 4.1 software (Bruker Daltonics). Identification was made using the Bruker's database MBT Compass Library Revision L 2020 (9607 MSP).

Chemotaxonomic Analyses
Cellular fatty acids (FAs) were extracted from all compared Staphylococcus strains grown on the Trypticase soy broth agar (TSBA) plates under the same cultivation conditions with a cultivation temperature of 37 • C for 24 h to reach the late-exponential stage of growth according to the four-quadrant streak method. The extraction of cellular FAs was performed according to the standard protocol recommended by the MIDI Microbial Identification System [54]. Extracted FAs were identified using an Agilent 7890B gas chromatograph (Agilent Technologies, Santa Clara, CA, USA) according to the Standard Protocol of the Sherlock Identification System (MIDI Sherlock version 6.2, MIDI database RTSBA version 6.21).
Isolation and structure analyses of the peptidoglycan were performed according to published protocols and some modifications [35,55,56]. In brief, the amino acid composition of total hydrolysate (4 N HCl at 100 • C for 16 h) of the peptidoglycan was analyzed by gas chromatography/mass spectrometry (protocol 10 [35]). The partial hydrolysate (4 N HCl, 100 • C, 45 min) of the peptidoglycan was analyzed by high-resolution liquid chromatography mass spectrometry (LC-MS) as described in [35,55]. Enantiomeric analysis was performed by liquid chromatography as described recently [56].
Respiratory quinones were extracted and analyzed as described previously [57]. Their identity was confirmed by mass spectrometry as described previously [55].

Phylogenetic Analysis Based on 16S rRNA, Housekeeping Genes and Core Genome
The 16S rRNA gene sequences were amplified from crude boiled cell extracts and sequenced by Sanger sequencing in the Eurofins MWG Operon sequencing facility (Ebersberg, Germany) with previously described primers [59]. The partial rpoB gene was sequenced as described previously [60]. Initial identification of the strain to the genus level was based on pairwise sequence alignment and calculation of similarity values with the algorithm used in the EzBioCloud database v.2021.07.07 [61]. 16S rRNA gene sequences obtained from PCR products were aligned with those extracted from WGS data using RNAmmer version 1.2 [62]. The multilocus sequence data of six housekeeping genes that are commonly used in phylogenetic studies of the Staphylococcus genus were extracted from whole-genome sequence assemblies of type strains available in the NCBI Assembly resource [63], including the NCTC 3000 project (https://www.sanger.ac.uk/resources/downloads/bacteria/nctc/ accessed on 27 October 2021) and FDA-ARGOS project [64]. The partial gene sequences used correspond to the following gene coordinates of S. aureus: 1420..1974 for rpoB, 270..826 for groEL, 23..911 for dnaJ, 49..929 for gap, 383..1032 for tufA, and 50..480 for the sodA gene.
The phylogenetic analyses were performed with the software MEGA X [65]. Genetic distances were corrected using the Tamura-Nei model [66], and the evolutionary history was inferred using the maximum likelihood (ML) and neighbor-joining (NJ) methods using a bootstrap test based on 500 replications [67]. The up-to-date bacterial core gene (UBCG) pipeline version 3.0 was used for whole-genome phylogenetic analysis based on core gene sequences [68].

Genome Sequencing and Bioinformatics Analyses
Total genomic DNA was extracted using a GenElute Bacterial Genomic DNA kit (Sigma-Aldrich, St. Louis, MO, USA) from pure culture colonies cultivated on Colombia sheep blood agar (Oxoid). The preparation of DNA libraries with a Nextera XT DNA Library Preparation Kit (Illumina, San Diego, CA, USA) and whole-genome sequencing on the Illumina platform were conducted externally (LGC Genomics, Berlin, Germany) using 2 × 150 bp paired-end reads on the NextSeq sequencing platform (Illumina).
For sequencing using the Oxford Nanopore platform, bacterial DNA was isolated as described previously [71]. The library was prepared using the SQK-RBK004 rapid barcoding kit (Oxford Nanopore Technologies, Oxford, UK) according to the manufacturer's instructions. Libraries were sequenced with FLO-MIN106 flow cells (R9.4.1) in a MinION device (Oxford Nanopore Technologies, Oxford, UK). The device was controlled with the software MinKNOW version 4.1.2 (Oxford Nanopore Technologies, Oxford, UK).
Basecalling, demultiplexing and barcode trimming were performed using standalone ONT Guppy software version 5.0.11 using the config file dna_r9.4.1_450bps_sup.cfg with the default minimum q-score threshold, i.e., 10. The MinION reads were subsequently filtered by mapping to Illumina reads using Filtlong version 0.2.1 (https:// github.com/rrwick/Filtlong accessed on 20 September 2021) with a minimum length of 1500 bp and quality threshold set to 95%. Only data that exceeded these thresholds was used in the assembly. The quality of reads was assessed with FastQC version 0.11.9 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc accessed on 20 September 2021) and NanoStat [72]. Complete bacterial genome sequences were obtained using a hybrid assembly with Unicycler version 0.4.9 [73] using SPAdes version 3.12.0 [74], and the parameters chosen were bold mode and k-mers 21,55,77,99,127. The resulting assembly was polished with Pilon version 1.24 [75].
For pangenome analysis, the complete genomes were initially annotated with Prokka version 1.14.6 [76] and clustered with Roary [77]; the results were then visualized with the script roary_plots.py, which is provided in the Roary package. Further, the genome was annotated using the NCBI Prokaryotic Genome Annotation Pipeline [78]. Sequences were manipulated and inspected in the cross-platform bioinformatics software Ugene version 38.1 [79]. The multiple sequence alignment was visualised using EasyFig version 2.2.5 [80]. Gene content was further examined manually with NCBI BLAST (https://blast.ncbi.nlm. nih.gov accessed on 4 November 2021), and VGEs were identified with PHASTER [81], PhiSpy version 3.4 [82], IslandViewer 4 [83], and ISFinder [84]. The CRISPR/Cas system was characterized with CRISPRCasTyper [85]. Virulence factors were predicted using the VFanalyzer tool available at the Virulence Factors Database [86].